FIGURE 1 DOCUMENT CORPUS 



FIGURE 2 DICTIONARY 



DOC 1 : Hello, my name is Fred. 1 . name 

DOC2; Hello, my name is Scott. 2. fred 

D0C3 ; Scott says, "Live and let live." 3. scott 

4. Live 



FIGURE 3 DENSE MATRIX - INTEGER FORMAT 
(PRIOR ART) 

110 0 fSpace required : 

10 10 16 short ints at 2 bytes each = 32 bytes) 

0 0 12 



FIGURE 4 DENSE MATRIX - FLOATING POINT NUMBER FORMAT 

(PRIOR ART) 

0.707 0.707 0.0 0.0 (" Space required : 

0.707 0.0 0.707 0.0 12 floats at 4 bytes each 

0.0 0.0 0.447 0.894 = 48 bytes') 



(Note: 0.707= 1*1/(1^+12) '^2; 0.447 = 1*1/ (1' + 2-)"- ; 0.894 = 2*1/ (1^ + 2-)"^) 



FIGURE 5 SPARSE MATRIX - FLOATING POINT NUMBER FORMAT 

(PRIOR ART) . ... 



(1 0.707) (2 0.707) 
(I 0.707) (3 0.707) 
(3 0.447) (4 0.894) 



( Space required : 

6 short ints & 6 floats 

= 6*2 + 6*4 = 36 bytes) 



FIGURE 6 SMALL SPARSE MATRIX - FLOATING POINT NUMBER FORMAT 

1, 2: 0.707 rSpace required : 

1,3:0.707 7 short ints & 3 floats 

3,4,4: 0.447 = 7*2 + 3 *4 = 26 bvtes) 

(Note: 0.707 = 1/ (1^ + i^^; 0.447 = 1/ + 2^) "2) 



FIGURE 7 SMALL SPARSE MATRIX IN VECTOR FORM 



ALLDATA = 1 2 1 3 3 4 4 
STARTMARKER = 1,3,5 
MULT = 0.707 0.707 0.447 



START 



DEVELOP DICTIONARY 
FOR DOCUMENT CORPUS 
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ASSIGN INTEGER 




TO EACH 


803 


DICTIONARY TERM 
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DEVELOP ALLDATA VECTOR 
CONVERTING DOCUMENTS 
INTO A LISTING OF INTEGERS 
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FIGURE 8 



DEVELOP 
STARTMARKER 
VECTOR 
LISTING DOCUMENT 
STARTPOINTS 



DEVELOP 
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VECTOR 
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FIGURE 9 

( START ) 
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GET WORD 
FROM DOCUMENT 



CALCULATE NF 
FOR DOCUMENT, 
ADD TO MULTS 
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